Fault-Tolerant Distributed Match-Making with Weights - Parallel and Distributed Systems, 1996. Proceedings., 1996 International Conference on

نویسنده

  • Amane Nakajima
چکیده

Protocols to solve several distributed issues, such as name service, mutual exclusion, and creation of an atomic shared register, require two types of subsets with intersection property. Distributed matchmaking provides a method of creating the subsets, and the lower bound of the number of messages t o solve the issues. This paper discusses the fault-tolerant and weighted case, in which a protocol is fault-tolerant regarding node failures, and in which weights of subsets are different. The paper j r s t provides the lower bound of the number of messages required for a protocol in a general f o rm. Then, it concentrates a symmetric case and shows the lower bound in a simpler form. The paper also provides a method of constructing the t w o types of subsets, which realize the lower bound. It first shows a method f o r a fully symmetric case, and extends it for other cases. The extended method is prac tical. It creates a cyclic communication structure; and is valid for a n y degree of fault-tolerance and wezghts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault Tolerant Scheduling in Distributed Networks

We present a model for application-level fault tolerance for parallel applications. The objective is to achieve high reliability with minimal impact on the application. Our approach is based on a full replication of all parallel application components in a distributed wide-area environment in which each replica is independently scheduled in a different site. A system architecture for coordinati...

متن کامل

Exception Semantics in a Parallel Distributed Object Oriented Environment

Exceptions have been used to provide a mechanism in object oriented programming languages for assuring program safety and robustness. Although the implementation of exceptions is well understood in a non-concurrent language environment , its use is not as well established when dealing with the requirements of a parallel or distributed setting. Distributed and concurrent programming introduces m...

متن کامل

Parallel Computing in Networks of Workstations with Paralex

Modern distributed systems consisting of powerful workstations and high-speed interconnection networks are an economical alternative to special-purpose supercomputers. The technical issues that need to be addressed in exploiting the parallelism inherent in a distributed system include heterogeneity, high-latency communication, fault tolerance and dynamic load balancing. Current software systems...

متن کامل

Using Peer Support to Reduce Fault-Tolerant Overhead in Distributed Shared Memories

We present a peer logging system for reducing performance overhead in fault-tolerant distributed shared memory systems. Our system provides fault-tolerant shared memory using individual checkpointing and rollback. Peer logging logs DSM modification messages to remote nodes instead of to local disks. We present results for implementations of our fault-tolerant technique using simulations of both...

متن کامل

Availability management of distributed programs and services

Modern distributed applications pose increasing demands for high availability, automatic management , and dynamic connguration of their software systems. This paper presents the architecture of Sampa, a System for Availability Management of Process-based Applications, which aims at fullll-ing these requirements. The system has been designed to support the management of fault-tolerant DCE-based ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991